Comparison of Affymetrix expression array summarization methods for reproducibility and consistency across studies

نویسندگان

  • Xiaoyang Ruan
  • Ourania Kosti
  • Rado Goldman
  • Hongfang Liu
چکیده

Affymetrix gene expression microarray is a popularly used platform for differential analysis. The analysis pipeline includes five steps: background correction, normalization, PM-only correction, and summarization, and differential analysis. Using publicly available microarray data, we compared the performance of five summarization methods: Median, Mean, Median Polish, Robust Linear Model, Li-Wong. Our evaluation criterion was reproducibility between studies designed to answer same scientific questions. Our analysis shows that mean value summarization gives smaller number of transcripts with inconsistent fold change direction while maintaining reproducibility comparable to competing complex methods. We conclude that after raw data has been preprocessed by the most popularly used pipeline (Robust Multiple Regression (RMA) background correction, quantile normalization, and PM-only correction), mean value summarization may convey a better representation of the true expression levels of target transcripts. The study suggests that the selection of bioinformatics algorithms needs to be application oriented. Sometimes simple initiative approach is probably better.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Integrating probe-level expression changes across generations of Affymetrix arrays

There is an urgent need for bioinformatic methods that allow integrative analysis of multiple microarray data sets. While previous studies have mainly concentrated on reproducibility of gene expression levels within or between different platforms, we propose a novel meta-analytic method that takes into account the vast amount of available probe-level information to combine the expression change...

متن کامل

Experimental Comparison and Evaluation of the Affymetrix Exon and U133Plus2 GeneChip Arrays

BACKGROUND Affymetrix exon arrays offer scientists the only solution for exon-level expression profiling at the whole-genome scale on a single array. These arrays feature a new chip design with no mismatch probes and a radically new random primed protocol to generate sense DNA targets along the entire length of the transcript. In addition to these changes, a limited number of validating experim...

متن کامل

Systematic order-dependent effect in expression values, variance, detection calls and differential expression in Affymetrix GeneChips®

MOTIVATION Affymetrix GeneChips are common 3' profiling platforms for quantifying gene expression. Using publicly available datasets of expression profiles from human and mouse experiments, we sought to characterize features of GeneChip data to better compare and evaluate analyses for differential expression, regulation and clustering. We uncovered an unexpected order dependence in expression d...

متن کامل

A new summarization method for affymetrix probe level data

MOTIVATION We propose a new model-based technique for summarizing high-density oligonucleotide array data at probe level for Affymetrix GeneChips. The new summarization method is based on a factor analysis model for which a Bayesian maximum a posteriori method optimizes the model parameters under the assumption of Gaussian measurement noise. Thereafter, the RNA concentration is estimated from t...

متن کامل

A comprehensive comparison of RNA-Seq-based transcriptome analysis from reads to differential gene expression and cross-comparison with microarrays: a case study in Saccharomyces cerevisiae

RNA-seq, has recently become an attractive method of choice in the studies of transcriptomes, promising several advantages compared with microarrays. In this study, we sought to assess the contribution of the different analytical steps involved in the analysis of RNA-seq data generated with the Illumina platform, and to perform a cross-platform comparison based on the results obtained through A...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011